| Name | Version | Summary | date |
| unsloth |
2025.10.11 |
2-5X faster training, reinforcement learning & finetuning |
2025-10-28 12:41:47 |
| kymnasium |
1.0.9 |
The collection of reinforcement learning environments developed for the Artificial Intelligence course at the Department of Computer Science and Engineering, Kangwon National University. |
2025-10-26 22:44:04 |
| numberlink |
0.1.6 |
NumberLink puzzle environment for Gymnasium |
2025-10-26 18:28:11 |
| gymnasium-2048 |
0.1.0 |
A reinforcement learning environment for the 2048 game based on Gymnasium |
2025-10-26 08:05:01 |
| LevDoom |
1.0.3 |
LevDoom: A Generalization Benchmark for Deep Reinforcement Learning |
2025-10-25 17:16:51 |
| ai-snake-lab |
0.11.3 |
Interactive reinforcement learning sandbox for experimenting with AI agents in a classic Snake Game environment. |
2025-10-23 03:37:46 |
| ev2gym |
2.0.0 |
A realistic V2G simulator environment |
2025-10-17 22:17:18 |
| PaLM-rlhf-pytorch |
0.7.1 |
PaLM + Reinforcement Learning with Human Feedback - Pytorch |
2025-09-19 17:05:25 |
| xcsf |
1.4.10 |
XCSF learning classifier system: rule-based evolutionary machine learning |
2025-09-11 22:15:27 |
| ReplicantDriveSim |
0.5.9 |
A Unity Traffic Simulation |
2025-09-10 01:29:47 |
| jaxsim |
0.8.0 |
A differentiable physics engine and multibody dynamics library for control and robot learning. |
2025-09-09 12:46:58 |
| adaptiq |
0.12.8 |
An offline Q-learning framework for AI agent prompt optimization. |
2025-09-08 14:36:49 |
| ethical-gardeners |
0.0.1 |
A RL environment for learning ethically-aligned behaviours |
2025-09-08 07:18:02 |
| reaf |
1.0.0 |
Robotics Environment Authoring Framework. |
2025-09-05 11:27:30 |
| commonpower |
0.6.1 |
A package for the exploration of safe single/multi-agent reinforcement learning in smart grids. |
2025-09-01 16:21:44 |
| qrl-qai |
0.1.0 |
Quantum Reinforcement Learning library: environments, policies and training loops with PennyLane. |
2025-08-28 13:23:05 |
| deepcubeai |
0.2.1 |
Learning Discrete World Models for Heuristic Search |
2025-08-27 22:54:38 |
| HASARD |
0.2.0 |
Egocentric 3D Safe Reinforcement Learning Benchmark |
2025-08-26 21:39:27 |
| questions-gen |
0.1.1 |
AI-Powered Mathematical Competition Problem Generation Package |
2025-08-17 13:24:55 |
| brax |
0.13.0 |
A differentiable physics engine written in JAX. |
2025-08-15 18:50:22 |